First-Order Algorithm with O(ln(1/e)) Convergence for e-Equilibrium in Two-Person Zero-Sum Games

نویسندگان

Andrew Gilpin

Javier Peña

Tuomas Sandholm

چکیده

We propose an iterated version of Nesterov’s first-order smoothing method for the two-person zero-sum game equilibrium problem min x∈Q1 max y∈Q2 xAy = max y∈Q2 min x∈Q1 xAy. This formulation applies to matrix games as well as sequential games. Our new algorithmic scheme computes an -equilibrium to this min-max problem in O ( ‖A‖ δ(A) ln(1/ ) ) first-order iterations, where δ(A) is a certain condition measure of the matrix A. This improves upon the previous first-order methods which required O(1/ ) iterations, and it matches the iteration complexity bound of interior-point methods in terms of the algorithm’s dependence on . Unlike interior-point methods that are inapplicable to large games due to their memory requirements, our algorithm retains the small memory requirements of prior first-order methods. Our scheme supplements Nesterov’s method with an outer loop that lowers the target between iterations (this target affects the amount of smoothing in the inner loop). Computational experiments both in matrix games and sequential games show that a significant speed improvement is obtained in practice as well, and the relative speed improvement increases with the desired accuracy (as suggested by the complexity bounds).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

First-Order Algorithm with O(ln(1/ )) Convergence for -Equilibrium in Two-Person Zero-Sum Games

متن کامل

A TRANSITION FROM TWO-PERSON ZERO-SUM GAMES TO COOPERATIVE GAMES WITH FUZZY PAYOFFS

In this paper, we deal with games with fuzzy payoffs. We proved that players who are playing a zero-sum game with fuzzy payoffs against Nature are able to increase their joint payoff, and hence their individual payoffs by cooperating. It is shown that, a cooperative game with the fuzzy characteristic function can be constructed via the optimal game values of the zero-sum games with fuzzy payoff...

متن کامل

Vehicle Routing Problem in Competitive Environment: Two-Person Nonzero Sum Game Approach

Vehicle routing problem is one of the most important issues in transportation. Among VRP problems, the competitive VRP is more important because there is a tough competition between distributors and retailers. In this study we introduced new method for VRP in competitive environment. In these methods Two-Person Nonzero Sum games are defined to choose equilibrium solution. Therefore, revenue giv...

متن کامل

Distribution Design of Two Rival Decenteralized Supply Chains: a Two-person Nonzero Sum Game Theory Approach

We consider competition between two decentralized supply chains network under demand uncertainty. Each chain consists of one risk-averse manufacturer and a group of risk-averse retailers. These two chains present substitutable products to the geographical dispensed markets. The markets’ demands are contingent upon prices, service levels, and advertising efforts of two supply chains. We formulat...

متن کامل

A simple and numerically stable primal-dual algorithm for computing Nash-equilibria in sequential games with incomplete information

We present a simple primal-dual algorithm for computing approximate Nash equilibria in two-person zero-sum sequential games with incomplete information and perfect recall (like Texas Hold’em poker). Our algorithm only performs basic iterations (i.e matvec multiplications, clipping, etc., and no calls to external first-order oracles, no matrix inversions, etc.) and is applicable to a broad class...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Math. Program.

دوره 133 شماره

صفحات -

تاریخ انتشار 2008

First-Order Algorithm with O(ln(1/e)) Convergence for e-Equilibrium in Two-Person Zero-Sum Games

نویسندگان

چکیده

منابع مشابه

First-Order Algorithm with O(ln(1/ )) Convergence for -Equilibrium in Two-Person Zero-Sum Games

A TRANSITION FROM TWO-PERSON ZERO-SUM GAMES TO COOPERATIVE GAMES WITH FUZZY PAYOFFS

Vehicle Routing Problem in Competitive Environment: Two-Person Nonzero Sum Game Approach

Distribution Design of Two Rival Decenteralized Supply Chains: a Two-person Nonzero Sum Game Theory Approach

A simple and numerically stable primal-dual algorithm for computing Nash-equilibria in sequential games with incomplete information

عنوان ژورنال:

اشتراک گذاری